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SUMMARY 


STOCHASTIC  APPROXIMATION  WITH  CORRELATED  DATA 

New  almost  sure  convergence  results  are  developed  for  a special 
form  of  the  multidimensional  Robbins-Monro  (RM)  stochastic  approximation 
procedure.  The  spe  Lai  form  treated  can  be  viewed  as  a stochastic 
approximation  to  the  solution  w = w^  RP  of  the  linear  equations  Rw  = P, 
where  R is  a pxp  positive  definite  symmetric  matrix.  This  special 
form  commonly  arises  in  adaptive  signal  processing  applications. 
Essentially,  previous  convergence  results  for  the  RM  procedure  contain 
a common  "conditional  expectation  condition"  which  is  extremely 
difficult  (if  not  impossible)  to  satisfy  when  the  "training  data"  is 
a correlated  sequence.  In  contrast,  the  new  convergence  results 
incorporate  moment  conditions  and  covariance  function  decay  rate 
conditions.  The  ease  with  which  these  results  can  be  applied  in 
many  cases  is  illustrated. 


1.  Introduction.  Consider 


the  set  of  linear  equations  Rw-P , 


where  R is  a pxp  symmetric 


positive  definite  matrix,  and  w and  P are 


pxl  matrices.  In  case  R and  P are  unknown,  and  the  solution,  w = Wq-R 
is  desired,  many  techniques  are  available  for  finding  an  estimate  of 
w . Tn  many  adaptive  signal  processing  applications,  a recursive, 
computationally  efficient  procedure  for  estimating  Wq  is  an  import ai t 
issue.  A suitable  multidimensional  version  of  the  Robbins-Monro  (RM) 
stochastic  approximation  procedure  (Robbins  and  Monro  (1951))  for  re 
cursively  estimating  wq  is  given  by 

(1)  Wn+i  = Wn  + Un(Pn  " FnV’  n - 

where  {P  } is  a sequence  of  random  variables,  ls  a sequence 

n 


P, 


of  random  pxp  matrices,  ivj  is  a sequence  of  positive  constants,  and 
W eR*  is  arbitrary.  It  is  assumed  that  Pn  and  Fn  are  functionally 
independent  of  ,W2 , . . . , It  is  somewhat  helpful  to  consider  wq  to 
be  the  vector  which  minimizes  £(«)  = w'Rw  - 2w'P,  where  ' denotes  matrix 
transpose.  Interpreting  Fr  and  Pn  as  "instantaneous  estimates"  of  R 
and  P,  respectively,  the  relationship  between  (1)  and  (deterministic) 
steepest  descent  procedures  is  obvious.  Consequently,  the  family  of 
algorithms  represented  by  (1)  has  an  interpretation  as  a family  of 

"stochastic  gradient-following"  algorithms. 

Algorithm  (1)  provides  a suitable  framework  for  the  analytical 

treatment  of  many  of  the  algorithms  that  have  been  proposed  in  the 

engineering  literature  for  adaptive  signal  processing  applications 

(e.g.,  see  Sakrison  (1966)  or  Farden  (1975)).  For  such  applications, 

any  "conditional  expectation  assumption"  is  extremel''  difficult 

(if  not  impossible)  to  establish.  Such  conditions  are  commonly  required  by 


........  s.. . . . . 


.V 
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existing  convergence  theorems  treating  the  RM  procedure.  For  example, 
see  Schmetterer  (1961,  1969),  Sakrison  (1966),  or  Farden  (1975) 
for  a discussion  of  existing  results.  The  practical  application  of 
existing  convergence  results  for  the  RM  procedure  to  (1)  esse  tially 
requires  that  {Pn  - F^w}  is  an  uncorrelated  sequence  for  all  fixed 
parameter  mRP.  The  special  form  of  the  RM  procedure  represented  by 
(1)  enables  us  to  obtain  convergence  results  which  make  use  of  no 
such  conditional  expectation  requirements  and  have  a decidedly  differ- 
ent flavor  than  existing  results  for  the  RM  procedure. 

The  contents  and  organization  of  this  paper  are  as  follows. 

Notation  and  basic  assumptions  are  presented  in  Section  2.  The 
framework  presented  in  Assumption  (2.1)  establishes  that  the  sequences 

} and  {P  } are  such  that  the  "time  averages"  of  E(Fn>  and  E(Pn), 
n n 

respectively,  are  eqr1  to  R and  P.  The  generalization  resulting  from 

these  definitions  is  applicable  to  cases  where  E(Fn>  and  E(Pfi)  are  periodic, 
such  as  occurs  in  some  adaptive  digital  communication  applications. 

In  Section  3,  it  is  shown  that  Assumptions  (2.1)  through  (2.7)  are 
sufficient  for  the  a.s.  convergence  of  to  wq.  The  proofs  of 
Lemma  (3.1)  and  Theorem  (3.2)  below  are  quite  similar  in  spirit  to 
the  proofs  of  Theorem  (6.1)  and  Theorem  (6.3)  of  Albert  and  Gardner 
(1967),  respectively.  However,  the  seemingly  less  restrictive  assumptions 
made  in  the  present  work,  the  simplification  in  proof  resulting  for 
symmetric  Fn>  and  the  basic  differences  in  the  form  of  algorithms 
treated  here  are  offered  as  justification  for  including  the  results 


of  Section  3 in  this  paper. 


Furthermore,  in  contrast  with  the 
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assumptions  made  by  Albert  and  Gardner  (1967),  the  form  of  Assumptions 
(2.3)  - (2.7)  permits  us  to  exploit  the  Borel-Cantelli  Lemma  and 
the  results  of  Serfling  (1970)  to  prove  Corollary  (4.5).  Corollary 
(4.5)  provides  easily  verified  sufficient  conditions  for  Assumptions 
(23) - (2 . 7) , and  hence,  for  the  a.s.  convergence  of  to  wq.  Several 
special  cases  of  (1)  are  considered  in  Section  5 to  illustrate  the 
application  of  these  results.  In  case  and  are  strongly  con- 
sistent estimates  of  R and  P,  respectively,  the  much  simpler  convergence 
result  of  Section  6 is  applicable. 
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2.  Notation  and  basic  assumptions.  The  norm  of  a pxp  matrix 
A,  denoted  by  ||a||,  is  defined  here  by  | | A | | = sup  !Ax|,  xeRP,  where 

M=1 

Rp  denotes  p-dimensional  Euclidean  space  and  |x|,  xeRF  denotes  the  usual 

p-dimensional  Euclidean  norm.  For  A real  and  symmetric,  | | A | | = max  {|  X (A)  |}, 

i 

where  {X^(A)}?_^  are  the  p eigenvalues  of  A.  The  minimum  and  maximum 

eigenvalues  of  a pxp  matrix  A are  denoted  by  X^n(A)  and  ■>linax (A)  • respectively. 

The  element  of  a pxp  matrix  A occurring  in  the  ith  row  and  jth  column  of 

A is  denoted  by  (A).  ..  Similarly,  the  ith  element  of  xeRP  is  denoted  by 

i«  J p 

(x)  . The  trace  of  a pxp  matrix  A is  denoted  by  tr(A)  = £ , (A)  ..  The 

i x 1,1 

symbol  0 is  used  to  denote  the  additive  identity  for  R^R1*,  or  to  denote 
a pxp  matrix  of  zeros.  Square  brackets  [ ] are  used  to  denote  integer 
part.  Finally,  subscripted  variables  like  n£,  , etc.  are  sometimes 

denoted  by  v(k),  n(£),  SL(i) , etc. 

All  random  variables  are  assumed  to  be  defined  on  a probability 
space  (fi,  F,  P) . All  relationships  between  random  variables  are 

to  be  interpreted  to  hold  with  probability  one. 

It  is  worth  emphasizing  that  Corollary  (4.5)  below  establishes 

sufficient  conditions  for  Assumptions  (2.3)— (2.7). 

(2.1)  ASSUMPTION.  The  sequence  (W  } .,  W eRP,  satisfies  the 

n n=l  n 

recursion 


(1)  W = W + y (P  - F W ),  n > 1, 
n+1  n n n n n — 

where  {P  } is  a sequence  of  random  variables,  P eRP,  (F  } is  a 
n n n 

sequence  of  real  symmetric  nonnegative  definite  (pxp)  random  matrices, 

{p^l  is  a sequence  of  positive  real  numbers,  and  W^eRp  is  arbitrary. 

-1  _i  n 1 n 

Define  Wq  = K F,  where  R = lim  E (F£),  P = lim  n"1^  E(P  ). 

n-**1 

It  is  assumed  that  E(F£),  E(P  ) exist, that  the  above  limits  exist, 

_1  n'*’a 

and  that  R is  positive  definite.  Further,  it  is  assumed  that  n"1  E E(F  ) 

£=a+l  1 

converges  uniformly  to  R as  n-***’  for  all  non-negative  integers  a. 
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Defining  V 


= w - w and  C = P - F w , we  have 


n n o 


n n no 


\i  = (I  - u F )V  + yC 
L)  vn+l  V n n'  n n n 

For  a sequence  {A^}  of  pxp  matrices,  define 


(3)  n A 

1=1 


j W 1*  * *A£* 

l I,  if  k < 1. 


. . .A  , if  k > s. 


Defining  m = 11  ^ y j Fj  ^ ’ ^n  11=1  \+l,nykCk’ 

and  iterating  (2),  one  obtains 


(4)  V = Q,  V + A . 

K ’ n+1  xl,n  1 n 

(2.2)  ASSUMPTION.  The  sequence  is  a nonincreasing  sequence 

of  positive  constants  Pn  = °(n  >♦  0 < nyn  < "* 


(2.3)  ASSUMPTION.  Assumptions  (2.1)  and  (2.2)  hold  and  UnllFnl 


a.s. 


0 as  n -*». 


a+n 


“*  1 3 S • 

(2.4)  ASSUMPTION.  Assumption  (2.1)  holds  and  n £ R 

k=a+l 


as  n for  all  positive  integers  a. 

(2.5)  ASSUMPTION.  There  exists  a sequence  of  random  integers 

{vR}  with  1 = vx  < v2  < v3  <. . . such  that,  with  pR  = vk+1  - and 

j = {v  , v. +1  , . . • , v,  --1}  we  have  (i)  P,  = C+[k  3,  for  some 
k k k krl  K- 

(ii)  Vn(  E Fi}-  6 > °’  and  (Ui)  Pk1Xmax(.ZT  Fj)<Y<“‘ 
JeJk  ' jeJk 

The  quantities  5,  6,  and  Y are  all  random  variables  that  are  independent  o 

(2.6)  ASSUMPTION.  Assumptions  (2.1)  and  (2.2)  hold  and  there 

A n g 

exists  a random  variable  SkRF  such  that  S=  S as  n->-TO. 

(2.7)  ASSUMPTION.  Assumptions  (2.1)  and  (2.6)  hold  and 

|F  (S  - S .)  |+-S‘0  as  n-*». 

1 n n- 1 

3.  Almost  sure  convergence  of  to  wq. 
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(3.1)  LEMMA. 


sumptions  (2.1)-(2.S)  are  satisfied , then 


yO  I I 

1 v = v Cn'i  be  such  that 
PROOF.  For  any  positive  integer  n,  let 


_ 0 , j and  hence  | |Q,  II  1 ll^K)^1 

neJK.  Then  Q1>n  - Qv(K),n  ^i,v(K)-l’  1>n 


ltJK.  l»n  , 

T may  be  expressed  as 

Iq  ....  J|.  Defining  T - Qv(k)fV(k+1)-i,  k 
|X1, v(k)-l 


r = T _ i p.F.  + Z ("Dq  z * VL(i)  F?.(q-i+D» 

k ieJk  1 3 q=2  Lk,q 


, . _ fn  rT  • j >£„>...>  X-  , i-1,2 , . . ,q) , 

where  L^  - UicJk  ‘ 1 2 C 


so  that  (for  Fv(.k+i^_1Pk'5<1^ 


Pi+1 


“k  “k 


k q p 6+ 

M iii-%(k+i,-ipk5+i2(“v<wpo)  '1"pv(«i>-1  k 1 ‘ 


. . u p y a„d  the  last  equality  holds  provided  that  * 1- 

where  a - U v(k)f  k* 

,,  ..  and  (2  5)  imply  that  there  exists  a random  positive 
Assumptions  (2.2)  and  U-o/  v y 

. 1 ~t,a  1 1 r I I < 1-B  f°r  3 11  k >_  k , where 
integer  kQ  such  that  ctk  i 2 and  I I rkl  I - k ° 

_ 1 „ P J.  Similarly,  for  all  K > kc  and  ntJK>  I l<lv(K),„l  k3  2' 

2 uv(k+l)-l  k 


Kk  2 vfk+i;-i  k.  mu  v.,1 , 

It  follows  that  there  exists  a random  variable  N such  that  for  all  K_  0> 

| q | | < | M IT  (1-Bk)if  M exp (-  25  * yv(k+l)-lPk  ’ 

l>n  ' k=k  o 

o 


since  1-x  < e-  for  all  real  x.  It  is  easily  shown  that  Assumptions 
(2.2,  and  (2.5)  imply  that  the  above  su«nation  diverges  to  * as  K a-  -. 

Consequently,  ! IQj.,,,1  lV'  0 aS  n * “ 

(3.2)  theorem!  tftmmptum  are  satisfied,  then 


1 (Z  * 


y I V1 • 0 as  n 

n 


PROOF.  From  (2.1.4)  and  Lemma  (3.1),  it  remains  only  to  show  that 


A laASl  0 as  n -*  ».  From  Assumptions  (2.1)  and  (2.6),  with  S0 
n 1 


■)  = I,  we  have 

^n+1, n 


. i. 
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“>  A„-  \ \rQw,»>  Vi  + S» 

k=l 


n 

- v o i),  F S,  , + S 

, ^k+l,n  k k k-1  n 
k=l 


Defining 


Y„  - *t  W \Fk  <«W- 


and 


Dn  - : E.  Vl.n  "A5’ 
k=l 

(1)  may  be  expressed  as  &n  + sn‘ 

From  Lemma  1 of  Albert  and  Gardner  (1967,  p.  189),  Dn  - (I_Qlfn>S»  so 
that  An  = B1>n  + (Sn-S)  + QUn  S.  Since  Sn  ^ S and  llQ^JI  ^ 0 
as  n -*  « (Assumption  (2.6)  and  Lemma  (3.1)),  it  remains  to  show  that 

|B  I V3-  0 as  n > ®. 

1 ,n 

Using  the  same  notation  as  in  the  proof  of  Lemma  (3.1),  lB1>nl 
may  be  bounded  as 


B1  ,n  - "xv(K),n 


k=k  i=k+l  jeJ, 

O K 


k i V1  V1 

+ IIqu(K)  „ll  * a-B  il  i ,°"  rt  j Vi,v(k+i)-iujFj<s'sj-i>l 

V(-N7,n  £=k  x.  k=1  i— k*F  1 jeJk 


+ I Qk+l,nykFk(S“Sk-l)'< 


k=v(K) 


Note  that  for  all  j ,me  and  k > kQ  we  have  | |Qj>inl  I 1 1 + < 


Consequently,  defining  dR  - max  |F . (S-S | , there  exists  a random 


jeJ 


variable  such  that 


CO|  tN 


8 


K-l  K-l 


l/i  — 4 


k=k  i=k+l 

o 


\ VkA 


+ | M » (1-6,)  + | P 

i=k 

o 


Defining  a = it  (1-B„)8.,  it  remains  only  to  show  that 
n’1  t-i+i  1 1 


K-l 

y a b'1  p p , ,d  aiS'  Aas  K+»,  Clearly,  for  all  fixed 
, , K-l,k  k lk  My(k)  k 
k=k 

o 

i > k a a Vs ' 0 as  n ->  00 . From  Lemma  1 of  Albert  and  Gardner 

— o n,  i 

n n 

(1967,  p.  189),  Z | a | - 1-  n (1-6.),  which  converges  a.s.  to  1 
i-k  n>  i-k 

O O 

as  n ®.  Consequently,  by  the  Toeplitz  Lemma  (e.g.,  see  Knopp 

K-l  _!  -1 

(1947,  p.  75)),  li.  I a 9v(k)VkBk  * di"  »,(k)VA  ' 
n k=k  k-*» 

o ^ 

By  Assumptions  (2.2)  and  (2.5),  W = I ~ “ is  bounded; 

VU;  * K ° Wv(k+1)-1 

hence,  I B , I a->s‘  0 as  n -*■  °°.  ^ 

l,o 

(3.3)  COROLLARY.  If  { vi^. > .'satisfies  Assumption  (2.2)  and  \ \F^\  \ 

is  a.s.  uniformly  bounded  (in  k) 3 then  Assumptions  (2.3)  and  (2.7) 

may  be  deleted  and  Theorem  (3.2)  remains  true. 

PROOF.  It  suffices  to  consider  p,  = k \ Since 

k 


If  (S-S  , ) |<| If  | | * | S— S .1,  Assumption  (2.6)  implies  Assumption  (2.7), 
i n n-1  i — i i n i n-1 

The  Borel-Cantelli  Lemma  and  the  Chebychev  inequality  can  easily  be 
applied  to  show  that  Assumption  (2.3)  is  satisfied.  D 

4.  Sufficient  conditions  for  Assumptions  (2,3)  - (2.7).  Several 
auxiliary  lemmas  will  be  needed  before  the  main  result  of  this  section, 
Corollary  (4.5)  may  he  stated  and  proved.  The  following  lemma  is  a 
reasonably  straightforward  extension  of  Theorem  A presented  by 
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Serf  ling  (1970).  Consequently,  the  proof  will  be  omitted. 

(A.l)  LEMMA.  Let  {x . } be  a sequence  of  random  variables,  x^rkf , 

having  finite  "variances"  <,,2  = EUx.-E(x  ))'  - E(x.l)].  For  integer 

u a+n 

. i sir)  fnwtD  y ( t x J .S'  — L a . x •,  and.  M ~ max 

n > J do fane  X^n  - <xa+i’ ” ” a+n  ’ a,n  z=  a+ 1 %y 


a,n 


{|  1 5a,n'}-  FoT  n ~2  defin6  Xa,n  ' (xa+n+l>  ‘ * ' ' Xa>  S< 


a,n 

y • • • > I d 


a,n 


a^\).  For  \n\>l  let  Fa,n 


1 x.y  and  M = max  { |S 

t-a+n+ 1 ^ a,n  a, ,n 

denote  the  distribution  function  for  X^n  ,and  let  g (F  be  a fmctvonal 
depending  on  F . Let  aQ  be  an  arbitrary  but  fixed  integer  and  let 
V > 2.  suppose' g (F^l  + g ^ t>  i 9 ^a>k+l>  f°'  M “ - %^ni 

2 < k < k + i or  a -a±k+tfk^_-l  such  that  E(  | g ^ a n 
for  all  a >_  aQ  and  n > 1 or  aQ  - a <n  <_  - 1.  Then  E(M^ J I (log 22\n\) 

a^(F  ) for  all  a > an  and  n > 7 or  a - a <n  < -1. 

J a, n 0 u 

(4.2)  LEMMA.  Let  ^ = a ^ ^ and  p(k,l)  = p(SL,k)  be  real-valued 
functions  defined  for  all  non-negative  integers  k,l.  For  1 <n  < m 
define 


m 

(1)  y = Z 
n,m 


m 


l a,  p(k,l) 
k=n  i=n  K’ 


m-n  m-u 


m 


= 2 L I ak,k+u  ‘,(k’k+u>  + JL  ak,kp<k‘kl- 

u-1  k=n  K=n 

Suppose  that  \p(k,k+u)\  = 0(u~v)  uniformly  for  all  positive  integers 
k.  If  a,  = 1 and  0<v<l,  then  for  large  m-n , 

J k,  a 


(2) 


' Y n,m ' 


! = 0(  (m-n)  ''  V ) . 


Fina l ly , 
(3) 


|Y  | = Ofn-',/fu+";j. 
1 n,m  1 


and  0<\><1 , 


then 
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PROOF.  Suppose  that  oi^  £ = 1 and  [p(k,k+u)|  = 0( u_V) , 0<v<l, 
uniformly  for  all  positive  integers  k.  It  suffices  to  consider 
| p (k,k+u)  | = ( | u | -f  ) V in  which  case  (from  (1)) 


m-n 


m-n+1 


|Y  | £ 2 (m-n+1)  E (u+1)  V<2  (m-n+1)  (1+/  x Vdx) . 
’ u=0  1 


The  result  (2) 


follows  easily  by  evaluating  the  above  integral. 


-1, 


Suppose  now  that  «k  £ = hk  = 0( k x) , and  |p(k,k+u)|  = 0(u  V) , 

0<v<l,  uniformly  for  all  positive  integers  k.  For  this  case,  it 
suffices  to  consider  p = k 1 and  |p(k,k+u)|  = ( ( u |+ 1) ~V . Then  from  (1) 


m-n  - m-u  _i  i 

1 7 _|  £2  E (u+1)  / x (x+u)  dx  + (n-1)  - m 

u=l  n-1 


m-n 


= 2 E u 1(a+l)"v  g + (n-l)_1-m_1, 

m mii  ' 


u=l 


n,m,u 


where 


3n,m  u “ !in  ( (m-u)  (n+u-l))  - £il(m(n-l)).  If  1 < u < £ £ n < m, 


then  B _<  £M(n+£-l)  - £n(n-l).  If  1 < £ < u < n < m-n,  then 

mi,  < 2-  If  n < u < m-n,  then  0 < £n(u). 

n,m,u  — — — n,m,u  — 

Consequently,  for  all  1 <_  £ < n-2  < m-n-2 , 

|y  | < 2 £ £n  (B±izi)  + 2 £n  2 £ u”1-v 

1 n,m 1 — n-1 

u=£+l 


m-n 


+2  E u 1 V£n  u + (n-1)  ^-m  ^ . 


u=n 


Letting  £-n  , 1>0>O,  and  using  the  fact  that  In  (l+x)<x  for  all  x>-l, 
it  follows  that  there  exist  constants  c^  , and  c3  such  that 

l7n,tJ  - 1 + c2n_eV  + c3n_V/2  ‘ 

Finally,  if  0 = (v+2)"1,  then  |yn  J = 0(n”v/v+2).  Q 
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(4.3)  LEMMA.  Define 

y.a,j,l,m)  = max  w'Z«F  ,.-F(F  J^fF.-EfF  .)  )ww' (V  %-l(T  )ww' ) )w, 

F M=) 

inhere  wcRP.  Define  Pk<a)  = a + [k*]t  where  a is  a positive  integer , 

0 < a <1.  Define  the  sequence  iv^a)}  by  = 1 , v^jla)  - v^(a)  + P^a), 

k = .7,2,...,  and  define  d^(a)  = {^(a),  v^(a)+lt . . . v^+j  (a)-l).  If 
Assumption  (2.4)  holds  and 


Yp  (i,jA>m)  < 00  -> 


00 

(7)  Z P.(a)  Z 

k=l  i,.i  k(a) 

for  some  a,  <?  <_  a <7,  and  for  some  positive  integer  a , tfeen  Assumption 
(2.5)  is  satisfied. 

PROOF.  Define  G (a)  = P^Ca)  1 Fj  and  \(a)  = Pk  (a\  5 , 

k k jeJk(a)  3 jeJkU) 

Let  e be  given  such  that  0 < c < Amin  (R)-  Since 

Vi„<Ck(»»iX„i„  + Xmln(R)  and  Xmax  (Gk(a))iXmax(Gk<a>-R)  + Xmax(R)  ' 

it  is  sufficient  to  show  that  there  exists  a random  sequence  (v^CO),  £ 
an  integer-valued  random  variable  with  £ a.s.  finite  such  that 

0<Xmin(R)  - a ‘ Xmin(Gk(S)-R)  + Xmn<R>iX,»ax(Gk<5)-R)  + Xmax(R) 

<W(R)  + r- 

for  all  k,  or,  equivalently,  that 

max  |w'(G  (0-R)w|  < e for  k = 1,2, Hence,  it  is  also  sufficent 

! w 1 = 1 

to  require  the  stronger  condition  that 

max|w'(Q  U)~\ U))w|  + max|w'(Rk(C)“R)w|  < e 
|w|=  i k k M=  i 

for  all  k . By  Assumption  (2.1),  there  exists  a positive  integer  q^ 
such  that  for  all  sequences  (vk(a)}  with  a >_  q^  we  have 
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max  |w'(Rk(a)-R)w|  < e/2.  It  follows  from  Assumption  (2.4)  that  for 
ill  n= 1 , 2 , . . ■ , there  exists  an  a.s.  rinite  random  variable  £n  such  that 

max  max|w' (Gk(^n)-Hk(Cn->)w|  < e/2. 

l<k<n  | w 1 =1 

•k 

Let  £ be  the  smallest  such  £ so  that  1 ^ 
n 

It  follows  that 

p (t  >a>q. ) <p  < ».»  Iw'iafal-^aiiwl^/a  for  some  k,  life-) 
n 1 jw|=l 

< z ^ ( max  iw'(Gk(a)-Rk(a))w|le/2) 

~ k=l  lw|=l 

< Z (|p  (a))”4  G(maxjw'(^k(a)-Rk(a))w|  ) 

k=l  2 |w| =1 

= (~)4  Z (P,  (a)"4  £ Y (ij^.m)’ 

£ k=l  k i,j,4|neJk(a)  F 

By  hypothesis  the  above  series  converges  as  n - Applying  the 

definition  of  P.U) , it  follows  that  for  all  e,  > 0,  there  exists 

a positive  integer  NCe^  such  that  for  all  a > NC^)  we  have 

lim  P(£  >a>qi)<£l.  Consequently,  £ - sup  £„  is  a.s.  finite  and 

n li  t-t 

n^co  . . U 

Assumption  (2.5)  is  satisfied. 

(4.4)  LEMMA.  If  there  exists  a real-valued  function  f(k,k+u) 
0r|M| uniformly  for  all  positive  integers  k such  that 

*■,">)  j<  (ij)f  + f(i,3)f(i**-)f<j>rn)f(l’m)> 

then  Assumption  (h.t)  is  satisfied. 

PROOF . 

consider  f(i,j)  = (|i~j|"Pi)  • 


It  suffices  to 
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Define  the  ijth  element  of  a matrix  A to  be  f(i,j).  Then 

b b b b /4  2 2 

E E E E f (i,j)f  (i,£)f  (j,m)fU,m)  = tr(A  )<(trA  ) . 
i=a  j=a  £=a  m=a 

2 

The  iith  element  of  A is  given  by 

9 i-l  _oo  k _2g 

(A)  . . = E (i-nrfl)  + E (m-i+x)  + 1 

1,1  m=a  m=i+l 

< c1(i-a+l)1~2B  + c2(b-i+l)1_23, 

for  0<3<-^  and  some  positive  constants  c^  and  c^.  Consequently,  for 

2 2-26  <4  . 

some  constant  c.^,  tr(A  ) <_  c^(b-a+l)  , and  hence  tr(A  ) = 0((b-a)  ). 

b b 

From  Lemma  (4.2),  ( E E f2(i,j))2  = 0( (b-a)4'46) . Clearly,  (4.3.1) 

i=a  j=a 

00 

be  satisfied  if  E k“4a  k-4cx-4ae<»#  It  follows  that  Assumption  (2.5) 
k=l 

is  satisfied  for  l>a>l/4g  if  ^ 

(4.5)  COROLLARY.  Suppose  that  Assumptions  (2.1),  (2.2),  and 
(2.5)  are  satisfied.  Define  pp(k,l)  = E(FjF^)  - E(F^)E(F ^) , 
p c(k,l)  = EfCjCj')  - E(C'k)E(Ck),  and  p pc(k,l,n)  = E ( (C^-E(C^)  )F^(C  Z~E(C  %) ) ) , 

where  all  expectations  are  assumed  to  exist.  Suppose  that  for  some 
v>0,  u max  {\\Pp(k,k+u)  \ \,  \ p Q(k,k+u)  \,  \ pFC(k,k+u,n)  \ ] is  uniformly 
bounded  for  all  non-negative  integers  k,  u,  and  n.  Further  suppose  that 

CO 

g = e p.ElC.)  exists  and  that  there  exists  a constant  8 > 1 such  that 
n k=n  k k 

co  _ CL  • S • 

E I q rfiYllF  I p<».  Then  \v  | 0 as  n-*». 

'^n  n n 

n=l 
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a+n 

PROOF,  First  consider  Assumption  (2.4).  Define  S = £ 

a,n  k=a+l 

p 

(F,_-E (F^) )w,  wtiere  wcR  . Clearly,  Assumption  (2.4)  is  satisfied  iff 


-1 


a , n 


1 aAs ' o 


M 


for  all  positive  integers  a as  n ->  <».  Define 

n = ™ax  {|Sa  ^ | , . . . , | }.  Let  {n^}  be  an  increasing  sequence 

)f  positive  integers  such  that  n.  -►  °°  as  k -+  For  all  n,  < n < n,  -1 

k k — — k+1 


-1 


-1 


<D  11  K J<n,/|S 


, -1 

a,iJ— nk  ^a,n(k)-l'  nk  a+n(k)-l ,n (k+l)-n(k) ' 

To  apply  Lemma  (4.1),  define  g(F  ) as 

a ,n 

2 a+n  a+n 

(2)  g(Fa  n}  = lwl  E z ||pf(M)||-  0 (n  V) , 0<v<l, 

’ k=a+l  £=a+l 

uniformly  for  all  positive  integers  a,  from  Lemma  (4.2).  It  is 

easily  seen  that  g(F  ) satisfies  the  needed  conditions  of  Lemma 

a ,n 

(4.1).  Letting  n^=k  , n^  E(|Sa  n(k)l^)  ^-s  summable  for  all  a>v 
The  Chebychev  inequality  and  Borel-CanteLli  Lemma  thus  imply  that 

"k  ° «•  * - »•  lectins  5k  - n(k+J_n(k)i 

(3)  nk2E(F,^)  = 0(nkV(log2  2 >2>  ■ 

from  (2)  and  Lemma  (4.1).  Substituting  n^  = ka  into  (3),  the  Borel- 

Cantelli  Lemma  and  the  Chebychev  inequality  imply  that  n a-Vs"  q 

1c  k 

as  k ->  oo  for  a > v ■*".  Consequently,  from  (1),  Assumption  (2.4)  is 
satisfied. 

Now  consider  Assumption  (2.6).  Define  S 


a 

Z 


. »k(Ck-E(Ck»' 

k=a+m+l 


M 


a , m 


= max{ I S 


a . m 


|S  , | ),  and  g(F  )=  Z Z 

a > t a,m  , , , . 

k=a+m+l  £=a  + 1 


Vilpc<k>t) 


It  is  easily  seen  that  g(F  ) satisfies  the  needed  conditions  of 

cj  j m 

Lemma  (4.1).  From 
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Lemma  (4.2),  E(|S 


a , n-a-1 


= 0(n"v/(vH’2'))  for  all  a > n.  With  n^k^, 


the  Chebychev  inequality  and  the  Borel-Cantelli  Lemma  thus  imply  that 

q-S  -g  = E p.(C.  - E ( C . ) ) 3+S  ‘ 0 as  k -»■  00  for  a>v  (v+2)  . 

n(k)-l  n(k)  1 1 t 

k 

Consequently,  S-S^k^_^  0 as  k + For  all  £ n 

<“>  ls-s„-ilils-sn(k+1)-1l  + 1 + bk' 

«k  ' Mn(k+1)-1 ,n(k)-n(k+l)  and  bk  is  a MqUenCe  °f  P°Sitlve  C°"_ 
stanl  s converging  to  zero  as  k -+  00 . From  Lemmas  (4.1)  and  (4.2), 

(5)  E(sJ)  = 0((log2  2(nk+1-nk))2n;v/(V+2)). 

Ct 

Substituting  n = k into  (5),  the  Chebychev  inequality  and  the  Borel- 

Cantelli  Lemma  imply  that  £k  ->  *0  as  k -*■  00  for  all  a > v (v+2). 

Consequently,  Assumption  (2.6)  is  satisfied. 

Finally,  consider  Assumption  (2.7).  Define  Z = F (C,-E(C,)), 

K y n n k.  k. 

a a a 

S = £ M Z 1,  and  g(F  ) * I Z p.p  |p  (k,£,a+m+l)  | . 

a,m  , ■ , 1 k k,a+m+l  ” a, nr  , ...  k £'  FC 

k=a+m+l  k=a+m+l  £=a+m+l 

Proceeding  as  above,  E ( | S _ 1 |2)  = 0(n  2^)  for  all  a > n. 

a y n a j. 

With  n = ka,  a > v F(v+2),  we  have 


Fn(k)(S  Sn(k)-1)  Fn(k)8n(k)  l Mi  Zi,n(k)  * °* 

1 II, 


as  k-*».  Since 


00  n 

E |g  I iSE(  | | F ||°)<«>,  the  Markov  inequality  and  the 


a ■ s ■ 

Borel-Cantelli  Lemma  imply  that  F^g^  4-  ' -<°.  Consequently,  for 

al  1 n,  <n<n,  , -1 , 
k k+1 


lFn(S-Sn-J)^Fn(k+l)(S-Sn(k+l)-l)l+?k  + \* 


’k  n(k+l)-l,n(k)-n(k+l) 


and  4',  a^s.  0 as  k-*».  Since  EL 


satisifes  (5),  £k  4-  -0  as  k-*».  Consequently,  Assumption  (2.7)  is 
satisfied. 
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Finally.  B>1  and  - Of""1)  “sily  provide  that  ° bV  ^ 

Markov  InequaUty  and  the  Borel-Cantelli  Lemma,  and  hence,  Assumption 

(2.3)  is  satisfied. 

Consequently,  from  Theorem  (3.2),  Ivja^s.  0 as  n-x».  11 

(4.6)  REMARK.  An  important  ancillary  result  contained  in 
Corollary  (A. 5)  is  that  sufficient  conditions  for  the  strong  consis- 
tency of  the  usual  sample  covariance  function  are  provided.  For 
example,  let  (x^  he  a xero  mean  wide-sense  stationary  real-valued 

norma]  random  process  and  define  p^fv)  - E<VVh2-  Consider 
v v ml  - x x It  is  easily  shown  that  for  this  case,  PF(k,k+v)  = 

Fk  " V ' k k+u. 

|p2(v)+Px(v+„)px(v-u)|.  The  proof  of  Corollary  (4.5)  shows  that  if 
p (u)=0(u-u)  for  v>0,  then  n ^ xkxk+v  * Px^  aS 

similar  result,  presented  as  Theorem  8B  of  Parpen  (1961),  states  that 

n~ ' v " “ n (v)  as  n A rn  provided  that  there  exist  positive 

Mx  1 


-1 


Vk+|v|“'Vi" 

k=  l 


k=l  "I  1Ty 1 oz  (u)  < cn  q for  all  positive  integers 

constants  c,  q such  that  n px  ~~ 

The  conditions  of  Parzen  (1961)  are  clearly  satisfied  by  Px(u)  = 

)(u  V)  and  v>0. 

(4,7)  REMARKS.  Recall  that  sufficient  conditions  for  Assumption 
(2.5)  have  been  presented  as  Lemmas  (4.3)  and  (4.4).  Lemma  (4.4)  is 
useful  for  several  specific  choices  of  <Fn>,  as  shown  below  in  Section 
5.  In  case  <||Fn||)  is  bounded,  or  if  fFn>  and/or  (Pj  are  deterministic, 
then  the  conditions  of  Corollary  (4.5)  are  simplified,  as  shown  below 
in  Corollaries  (A. 8)- (4 . 11) • 

(4.8)  COROLLARY.  Let  Pp,  Pc,  and  gn  be  as  in  Corollary  (4.5). 
Suppose  that  Assumptions  (2.1),  (2.2),  and  (2.5)  are  satisfied , and 
that  { | |F,| | } is  bounded.  If  there  exists  a v>0  such  that  u wax 
{\\pF(k,k+u)\\,\pc(k,k+u)\}  is  uniformly  bounded  for  all  non-negattve 
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integers  k and  u,  and  there  exists  a 3>0  such  that  Z | g | then 

It/  \cl.s.  „ n=l  n 

\vn\  0 as  n-> <*>. 


PROOF.  Simply  apply  Corollary  (3.3)  to  Corollary  (4.5).  □ 

(4.9)  COROLLARY.  Suppose  that  F is  deterministic,  and  that 
Assumptions  (2.1)  and  (2.2)  are  satisfied.  Define  p (k,l)=E(P'P  ) - 

00  p k i 

E (Pp)E (Pjt,) •»  and  9n  = £ ^k^ ^ o^ ' ^ there  exists  a v>0  such 

y k=n 

that  u | P k+u) | is  uniformly  bounded  for  alt  non-negative  integers 

k and  u,  and  if  g ->  0 as  n °°  then  \V  la4-s‘  n as  n + » 
n ' n' 

PROOF.  An  obvious  consequence  of  Corollary  (4.5).  Q. 

(4.10)  COROLLARY.  Suppose  that  {P^ } is  deterministic,  and  that 
Assumptions  (2.1),  (2.2),  and  (2.5)  are  satisfied.  Define  p „„ (k , Z , n)  = 

00  FF 

£ ( j and  9n  = 2 v^P^-EfF^)  w ).  If  there  exists 

v k=m 

a v>0  such  that  u max  {| [ 0 p(k, k+u) | |, | \ppp(k,k+u,n) \ |)  is  uniformly 
bounded  for  all  non-negative  integers  k,  u,  and  n,  and  there  exists  a 

00 

3>1  such  that  Z \g  \^E(\\F  \ \^)<°°,  then  \v  |a+S*  0 as  n+». 
n=  1 n n 

PROOF.  Follows  directly  from  Corollary  (4.5).  □ 

(4. LI)  COROLLARY.  Suppose  that  both  {F^}  and  {P^}  are  deterministic, 

and  that  Assumptions  (2.1)  and  (2.2)  are  satisfied.  If  Z u,  (P  -F  w ) 

k=l  k k k 0 

exists,  then  \v  I ■+  0 as  n -*■  ». 

1 n 1 

PROOF.  Trivial  case  of  Corollary  (4.5).  n 


5.  Application  of  Corollary  4.5. 

(5.1)  Special  families  of  F and  P . Let  {X  }°°  be  a sequence 

n n j -00  n 

of  RP-valued  zero-mean  random  variables  and  let  {sj”  be  a sequence 
of  real-valued  zero-mean  random  variables.  Define  R (k,£)  = E(X^X^) 
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P (k,«.)  = E(s  X ) and  p (k,JL)  = E(s  s ).  Suppose  that  R (k,k+u) , 
s k k s k x,  xx 

P (k,k+u),  and  p (k,k+u)  are  periodic  in  k with  period  N.  Define 

s 8 ,< 

R = N E R (k,k)  and  P = N E P (k,k).  It  will  become  apparent 
k=l  XX  k=l  S 

in  what  follows  that  R and  P satisfy  Assumption  (2.1). 

Suppose  that  it  is  desired  to  choose  weR*3  to  minimize  £(w)  = 

N 2 

E E((s,-w'^)  ).  Such  problems  arise  frequently  in  adaptive 
k=l  k k 

transversal  filter  channel  equalization  in  digital  communications. 

When  N = 1,  the  problem  reduces  to  the  use  of  jointly  wide-sense 

stationary  sequences  {s^}  and  {X ^ } . Assuming  that  R is  positive 

definite,  the  desired  solution,  w^ , is  given  by  w^  = R ^P.  Assume  now 

that  R and/or  P are  unknown,  and  that  it  is  desired  to  use  algorithm 

(2.1.1),  with  F and  P functions  of  the  observed  time  series,  {X.}  and 
n n j 

{s  },  to  recursively  estimate  w ,.  Obvious  candidates  for  F and  P 
j ’ 1 o n n 


are 


and 


(1)  F = K 1 E X.X.', 

n n j=n-K  +1  J J 
J n 


(2)  P = K_1  E s.X., 

n n j=n-K  +1  J J 
J n 


where  K is  a positive  integer;  e.g.  K = 1,  K,  or  n.  In  fact,  algorithms 
n n 

represented  here  by  (2.1.1)  with  F^  and  P^  given  by  (1)  and  (2)  have 
frequently  appeared  in  the  engineering  literature  for  consideration  in 
a wide  range  of  applications. 


Note  that  if  K = K (a  constant),  then  E(F  ),  E(P  ),  and 
n n n 

E(C  ) are  all  periodic  (in  n)  with  period  N.  Furthermore,  for  any 

n _i  n+N 

n > 0,  N E E(C  ) = 0.  If,  in  addition  {y,  } satisfies  either  y “ 
. k k k 

k=n 

k - 1 - 1 

a([^]  + b)  or  y^  = a(k+b)  , where  a > 0 and  b 0,  then  it  can  be 
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-1  2 

shown  that  jg  | = | Z p E(C  ) | = 0( n i).  If,  for  example,  E(||F  ||  ) 
n , K k n 

k~n 

00  9 9 

is  bounded  (in  n),  then  l |g  |"E(||F  ||")  < °°,  thus  satisfying  the 

n=l 

conditions  on  (g  Jstated  in  Corollaries  (4.5)  and  (4 . 8)- (4 . 10) . Of 
n 

course,  many  other  choices  of  ate  permissible.  Finally,  if  = N, 

then  E(C  ) = 0,  and  the  conditions  on  {g  } are  a fortiori  satisfied. 
k.  n 

The  remaining  conditions  on  {F  } and  {P  } stated  in  the  preceding 

n n 

corollaries  are  quite  mild  "asymptotic  covariance  decay  rate"  conditions. 

The  strongest  of  these  decay  rate  conditions  is  that  imposed  on  Yp  via 
Lemma  (4.3)  in  order  to  satisfy  Assumption  (2.5). 

Regarding  the  covariance  decay  rate  conditons  in  the  corollaries  of 
Section  4,  it  may  be  helpful  to  note  that 

(3)  pc(k,fc)  = Pp(k,£)  + w^pp(k,«,)wo  - PpF(k,£)  wq  - PppU,k)wo  , 

where  pp  and  pp  are  defined  in  Corollaries  (4.9)  and  (4.5),  respectively, 
and  ppp(k, ft)  = E(P'F£)  - E(P')E(F£).  Furthermore,  pp(,  can  be  expressed  as 

(4)  ppc(k,£,n)  = pppp(k,£,n)  + w'  ppp(k,£,n)wo 

-ppFF(k,£,n)wo-ppFF(^.k.n)w0. 

where  ppp  is  defined  in  Corollary  (4.10)  and  pppp(k,£,n)  = E ( (P^-E(P^) ) ' 

F^(P£-E(P£))),  and  pppp(k,ft,n)  = E( (P'-E(P') )F* (F£-E(F£) ) ) . The  results 

(3)  and  (4)  simply  express  the  conditions  on  {F^}  and  {C^}  stated  in  the 

corollaries  of  Section  4 in  terms  of  similar  conditions  on  {F^}  and  0^}* 

In  view  of  the  widespread  consideration  of  algorithms  fitting  the 

framework  of  (2.1.1),  (1),  and  (2),  it  is  of  interest  to  reduce  the  moment 

cond'tions  on  (F  } and  {P  } stated  in  the  corollaries  of  Section  4 to  moment 
n n 

conditions  on  {X.}  and  {s.}.  In  case  {X.}  and  {s,}  are  normally  distributed, 
greatly  simplified  conditions  can  be  established  for  this  family  of 
algorithms,  as  shown  in  Section  (5.2). 
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(5.2)  The  normal  case.  Assume  the  Bame  notation  and  structure  as 

in  Section  5.1.  Further,  assume  that  all  random  variables  involved  are 

jointly  normally  distributed.  Then  all  of  the  "covariance  decay  rate 

conditons"  for  {F  } and  (P  } stated  in  the  corollaries  of  Section  4 and 
n n 

Section  5.1  can  be  expressed  in  terms  involving  the  covariance  functions 

of  elements  of  { X . / and  fs.}.  In  order  to  accomplish  this,  some  pro- 
1 J 

perties  of  joint  moments  of  normally  distributed  random  variables  are 
required. 

Let  z, , z_,...,z0  be  zero-mean  jointly  normally  distributed  with 
12  o 

E(z  z ) = o ( i , j ) . Using  the  properties  of  the  characteristic  function 

i j 

of  z,,  z„,...,zQ,  i';  is  straightforward  to  show  that 
L 2 o 

4 

(1)  E ( it  (z2i_1z21  - o(2i-l,  2i) ) ) 
i=l 


where  the  summation  is  over  all  possible  ways  of  combining  £ i’^2’ ’ ' ’ ’^8 
e{l,2, . . . ,8),  £i^£ . for  i^j , into  four  distinct  unordered  pairs,  no 
pair  of  which  is  (1,2),  (3,4),  (5,6),  or  (7,8).  There  are  sixty  terms 
in  the  sum. 


With  F given  by  (5.1.1)  and  K = K,  and  defining  f(k,£)  = max 

n n l<i,jlp 

(R  (k,£))  | (1)  can  be  applied  to  show  that  the  conditions  of 

1 1 J 


xx 


lemma 


(4.4)  are  satisfied  if  for  some  v > l/4, 


\> 

u max 


1 < i i jf  P 


(Rxx(k'k+u))i,j 


is  uniformly  bounded  for  all  non-negative  integers  k and  u.  Similarly, 
if  for  some  v^O, 
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u ^ max  { j (P  (k,k+u) ) ^ j , | Pg (k,k+u) | } 

1<_£<p 

is  uniformly  hounded  for  all  non-negative  integers  k and  u,  then  the 

decay  rate  conditions  stated  in  Corollary  (4.5)  on  pp,  p^,  and  p^  are 

ail  satisfied.  Consequently,  each  member  of  the  family  of  algorithms 

represented  by  (2.1.1)  with  F and  P given  by  (5.1.1)  and  (5.1.2), 

1 n n 

and  K = K converge  a.s.  to  w provided  that,  in  addition,  {y  } and  K 
n o k. 

are  chosen  as  discussed  in  Section  5.1. 


For  example,  let  {x  :kc{0,±l, . . . ))  be  a real-valued  zero-mean  wide-sense  station- 

ary  autoregressive-moving  average  (ARMA)  normal  random  process.  With 

v = (x  x x,  „)'  and  s,  = x, , (integer  a),  the.  required 

\ k-1  k-p+1  k K+a 

decay  rates  of  (5.1.3)  are  easily  established.  Hence,  each  member  of 
the  family  of  algorithms  represented  by  (2.1.1)  with  and  given 
by  (5.1.1)  and  (5.1.2),  Kn  = K,  and  {y^  satisfying  Assumption  (2.2) 

converge  a.s.  to  w^. 


6 . A simple  almost  sure  convergence  result. 

(6.1)  THEOREM.  Suppose  that  Assumption  (2.1)  holds  and  that  there 
exist  sequences  (a^  and  IbJ^j  of  non-negative  real  numbers  (possibly 
random)  satisfying  \ | F-*|  and  Furthermore,  suppose 

that  there  exists  a positive  integer  n ^ (possibly  random)  such  that  for 


all  n>no,  0 <un  ^Ot)*!- 


Then  for  all  n>«0> 


a>  K+d-K  l-L  n-\‘V  +max  (bi/ik>  ' (1-W(1'vJdi 

O 0 u 


n 


(1-  II  (l-\i.d.))} 


n <k<n 

n — — 


where  d^  = Furthermore , if  ^y^’S’»  and  0 as 

then  I V |5,s  0 as  n-*™. 
j 1 n 
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PROOF.  From  Assumption  (2.1), 


V = V -p  RV  -u  (F  V +F  w -P  -RV  ) , 
n+1  n n n n n n n o n n 


so  that  for  all  n>n  , 

«)  ! vn+1 1 i(l-unXmln(R)  > I vn I +P„I  I ! 1 - 1 v„ ! -H.n I P„“o-pn I 


< (1-p  d ) |v  l+M  b . 

— n n n n n 

Iterating  (2),  for  all  n>nQ, 


n n 


.-1, 


(3)  |V  il£|Vn  ljT=n  d-Pkdk)+i=n  5=k+l(1"Midj)Pkdk(bkdk  }> 

o o 0 

Since  all  terms  appearing  in  the  sum  in  (3)  are  a.s.  non-negative,  >.1) 
follows  immediately  from  (3)  with  the  aid  of  Lemma  1 of  Albert  and 


Gardner  (1967,  p.  189).  Furthermore,  if  £=1Pkdk^' S *°°  and  V^1^'5'0  aS 


ia.s., 


n->“,  then  (3)  and  the  Toeplitz  Lemma  show  that  lvnl'>'  0 as  n~*°‘ 

(6.2)  Remark.  Theorem  (6.1)  is  applicable  to  the  case  discussed 
in  Section  5 with  K^=  k. 
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extremely  difficult  (if  not  impossible)  to  satisfy  when  the  ^training 
data1*  is  a correlated  sequence.  In  contrast,  the  new  convergence  results 
incorporate  moment  conditions  and  covariance  function  decay  rate  condi- 
tions. The  ease  with  which  these  results  can  be  applied  in  many  cases 
is  illustrated.^ 
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